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Abstract 

This paper addresses joint transceiver and relay design for a wireless multiple-input-multiple-output 
(MIMO) switching scheme that enables data exchange among multiple users. Here, a multi-antenna 
relay linearly precodes the received (uplink) signals from multiple users before forwarding the signal 
in the downlink, where the purpose of precoding is to let each user receive its desired signal with 
suppressed interference from other users. The problem of optimizing the precoder based on various 
design criteria is typically non-convex and difficult to solve. The main contribution of this paper is a 
unified approach to solve the sum mean square error (MSE) minimization and sum-rate maximization 
problems in MIMO switching. Specifically, an iterative algorithm is proposed for jointly optimizing the 
relay's precoder and the users' receive filters to minimize the weighted sum-MSE. It is also shown that 
the sum-rate maximization problem can be converted to an equivalent weighted sum-MSE minimization 
problem and can therefore be solved using a similar algorithm. Asymptotic analysis reveals that, with 
properly chosen initial values, the proposed iterative algorithms are asymptotically optimal in both 
high and low signal-to-noise-ratio (SNR) regimes for MIMO switching, either with or without self- 
interference cancelation (a.k.a., physical-layer network coding). Numerical results demonstrate that the 
optimized MIMO switching scheme based on the proposed algorithms significantly outperforms existing 
approaches in the literature. 

Index Terms 

Beamforming, linear precoding, MIMO switching, minimum mean square error (MMSE), physical- 
layer network coding, relay. 

I. INTRODUCTION 

Physical-layer network coding (PNC) has received much attention in recent years [2]. The 
simplest communication model for PNC is a two-way relay channel, in which two users ac- 

This work was presented in part at the IEEE International Symposium on Information Theory, Cambridge, MA, July 2012 
[1]. 
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complish bidirectional data exchange in two phases of transmission with the help of a relay. 
Significant progress has been made in approaching the ultimate capacity limit of two-way relay 
channel (cf., [2]-[8] and the references therein). 

More recently, multi-way relaying, in which multiple users exchange data via a single relay, 
has been studied [9]-[20]. In [9], [18]-[20], the authors studied such a system where the relay is 
equipped with a single antenna. The use of multiple antennas at the relay provides extra spatial 
degrees of freedom that can significantly enhance the network throughput. A multi-antenna relay 
that performs one-to-one mapping from the inputs to the outputs (i.e., that switches traffic in 
a one-to-one manner among the end users) is called a MIMO switch [10], [11]. Various traffic 
patterns have been studied in MIMO relaying, including pairwise data exchange [1 1]— [13], where 
the users form pairs and data exchange is within each pair, and full data exchange, where each 
user broadcasts to all other users [14]— [16]. The authors in [10] further generalized pairwise 
data exchange to arbitrary unicast, in which each user sends data to one other user and could 
receive data from a different user. Arbitrary unicast is interesting because we can realize any 
traffic pattern, including unicast, multicast, broadcast, or any mixture of them, by scheduling 
multiple arbitrary unicast flows. 

This paper is concerned with joint transceiver and relay design for MIMO switching with 
arbitrary unicast. Several pioneering works in this direction have been reported in the literature 
[10], [12], [13], [15], [17]. Zero-forcing relaying was first proposed in [12] to realize pairwise data 
exchange. In [17], the authors showed that zero-forcing relay with PNC, which employs self- 
interference cancelation, can considerably improve system throughput. However, zero-forcing 
involves channel inverse operations that could lead to significant power penalties when the 
channel gain matrix is ill-conditioned. To alleviate power penalties, the authors of [12] and 
[15] proposed minimum- mean- square-error (MMSE) relaying, which achieves better performance 
in the practical signal-to-noise ratio (SNR) regime. However, the improvement of the MMSE 
relaying therein is limited since: 1) the proposed scheme fixed the receive filter to a scaled 
identity matrix and optimized the precoder only; and 2) the proposed scheme does not exploit 
the benefit of network coding. To overcome the two limitations, we propose an MMSE relaying 
scheme that uses network coding and that jointly optimizes the receive filters. 

The transceiver design in the aforementioned works were based on a variety of criteria, such 
as zero forcing and MMSE. However, throughput is more directly related to user experience. 
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Unfortunately, throughput maximization problems are often non-convex, and hence difficult to 
solve. In this paper, we investigate system throughput through a sum-rate optimization problem. 

Our investigation indicates that two families of optimization problems, namely, sum-MSE 
minimization and sum-rate maximization can be tackled under one unified framework. Specif- 
ically, to minimize sum MSE, an algorithm can be devised to iteratively optimize the relay's 
precoder and the users' receive filters. We show that the sum-rate maximization problem can be 
converted to an equivalent weighted sum-MSE minimization problem. Thus, the problem of sum- 
rate maximization can also be solved in an iterative manner. In the low and high SNR regimes, 
we provide analytical results for the properties of the asymptotically optimal solutions and the 
convergence conditions of the proposed iterative algorithms. We show by numerical results that 
the proposed algorithms significantly outperform the existing approaches in the literature [10], 
[12], [15], [17]. 

The main contributions of this paper are 

1) a systematic design of unicast MIMO switching, with or without PNC, based on two 
widely adopted criteria, i.e., sum-MSE minimization and sum-rate maximization, and 

2) an asymptotic analysis which reveals that, with properly chosen initial values, the proposed 
iterative algorithm are asymptotically optimal in both very high and very low SNR regimes. 

The remainder of the paper is organized as follows: Section II introduces the background of 
wireless MIMO switching. Sum-MSE minimization and sum-rate maximization are discussed 
in Section III and IV, respectively. In Section V, asymptotically optimal solutions are derived. 
Section VI presents simulation results. Section VII concludes this paper. 

We adopt the following notational convention: Scalars are in normal fonts; boldface lower- 
case letters denote vectors and boldface upper-case letters denote matrices; diagja?} denotes a 
diagonal square matrix X with the elements of x being the diagonal elements; diag{X} denotes 
a column vector x with the diagonal elements of X being the elements. [^]dia g represents a 
diagonal matrix with the same diagonal elements as X. Denote by C a ,b the covariance of two 
zero-mean random variables a and b, i.e., C a 6 = E[a6*]. The operation vec(X) is to stack each 
column of the matrix X on top of the right adjacent column; mat(-) is the inverse operator of 
vec(-); denotes Moore-Penrose pseudo inverse [21]; and <g> denotes the Kronecker product. 
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Fig. 1. Wireless MIMO switching. 

II. System Description 

Consider K users, numbered from 1 to K, each equipped with a single antenna. These users 
communicate via a relay with N antennas and there is no direct link between any two users. 
The overall system is illustrated in Fig. 1. Throughout the paper, we focus on the pure unicast 
case, in which each user transmits to one other user only. Let 7r(-) specify a switching pattern, 
which can be equivalently represented by a permutation matrix P. 1 Let ej denote the jth column 
of an identity matrix. Then the ith column of P is equal to ej, i.e., p i = e T (,;j = ej. If the 
diagonal elements of permutation P are all zero, it is also called a derangement. In particular, 
a symmetric derangement (P = P T ) realizes a pairwise data exchange. Thus, the pure unicast 
pattern can be represented as follows: user i transmits to j = for every i E {1, - • • , K}. 
Any general traffic flow pattern among the users can be realized by scheduling a set of different 
unicast traffic flows [10]. 

Each round of data exchange consists of one uplink phase and one downlink phase. The uplink 
phase sees simultaneous transmissions from the users to the relay; the downlink phase sees one 



'A square matrix P is a permutation matrix if it has one and only one nonzero element on each row and each column, which 
is equal to 1. 
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transmission from the relay to the users. We assume that the two phases are of equal duration. 

In the uplink phase, let a; = [x± : --- ,xk] t be the vector representing the signals trans- 
mitted by the users. Let y = [yi, • • • ,Dn] t be the received signals at the relay's antennas, 
and u = [ui, ■ ■ ■ ,un] t be the noise vector with independent and identically distributed (i.i.d.) 
samples following the circularly- symmetric complex Gaussian (CSCG) distribution, denoted by 
£A/"(0,7 2 ), where 7 2 is the noise variance at the relay. Then 

y = Hx + u, (1) 

where H E £, NxK [ s the uplink channel matrix. We assume that all uplink signals are indepen- 
dent with zero mean and unit average power, i.e., E,{xiXj} = and E{x 2 } = 1, % — 1, • • • , K, 

3 ^ i- 

Upon receiving y, the relay precodes y with matrix G and forward Gy in the downlink phase, 
where the transmit power of the relay is upper-bounded by P r , i.e. 

Tr [G (HH H + 7 2 /) G H ] < P r . (2) 

The signals received by all users are collectively represented in the vector form as 

r = FGy + w = FGHx + FGu + v, (3) 

where F is the downlink channel matrix, and v is the noise vector at the receiver, with i.i.d. 
samples following the CSCG distribution, i.e., v k ~ CAf(0,a 2 ), where a 2 is the noise variance. 
The received signal r 7r ( i ) at user 7r(i) is used to recover the message from user i. Each user scales 
its received signal before estimation. We use a diagonal matrix C to denote the combination of 
all the scaling factors. Together with the definition of the permutation P previously, we could 
achieve perfect switching if CFGH = P in the absence of noise. In the presence of noise, G 
and C are chosen to optimize certain performance metric. The purpose of the MIMO precoder 
G is to manage interference based on the knowledge of the uplink and downlink channels H 
and F. A network coding technique can be used, assuming that users can cancel self-interference 
in the received signal. We refer to the precoding schemes without network coding as non-PNC 
schemes, and subsequent ones with network coding as PNC schemes. 
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III. Sum-MSE Minimization 

A. Problem Formulation 

We start with the sum-MSE minimization problem. Let user j scale its received signal to 
yield fj = CjVj. In the absence of network coding, the goal is to choose the precoder and scaling 
coefficients ci, • • • , c K so that fj is as close as possible to x { for every sender % and corresponding 
receiver j = ir(i). 

In the case where network coding is allowed, the goal is to make fj close to Xi after removing 
self-interference. The decision statistics for all K users can be expressed in vector form as Cr — 
Bx, where B is a diagonal matrix consisting of the weighting factors of self-interference. The 
weighted sum square-error is then given by \\W* (Px — (Cr — Bx)) || 2 , where || ■ || represents 
the Euclidean norm, and Wis a diagonal matrix with the ith diagonal element being the weight 
of the MSE at user %. 

Based on the above, the weighted sum-MSE minimization problem is formulated as 

2 



minimize E 

G,B,C 



(4a) 



((P + B)x - Cr 

subject to Tr [G (HH H + 7 2 l) G H ] < P r (4b) 
B and C are diagonal. (4c) 

The case of no network coding admits the same formulation except that the matrix B is set to 
0. Problem (4) involves the joint optimization of G, B and C. Unfortunately, this problem is 
non-convex, and thus is in general difficult to solve. We next propose an iterative algorithm to 
find a suboptimal solution to (4), as described below. 
Following the convention in [12], [22], we define 

G = a~ x G and C = aC, (5) 

where a is a scaling factor. Later, we will see that a is introduced to meet the power constraint 
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at the relay. Then the sum-MSE (4a) is expanded as 



J(G, a, B, C) =Tr 



W [ I + BB H - m{CFGH{P + B) H } 



+ CFG ( ////" + 7 2 /) G H F H C H + a 2 a- 2 CC H 



(6) 



We notice that, for fixed (B, C), both J(G, a, B, C) and the power constraint (4b) are quadratic 
with respect to (w.r.t.) G or a, which is a solvable quadratically constrained quadratic program 
(QCQP). On the other hand, for fixed (G,a), problem (4) also reduces to a solvable QCQP 
w.r.t. (B,C). In fact, both optimization problems admit explicit analytical solution. Thus, we 
can iteratively optimize (G, a) and (B, C), yielding an approximate solution to the problem in 
(4). 

It is worth noting that setting B = and C = I reduces the problem in (4) to the conventional 
MMSE relaying design problem (without PNC) studied in [12], [15]. We will show later that 
the flexibility of choosing B and C provides significant performance gains. 

B. Optimal (G, a) for Fixed (B, C) 

For fixed (B,C), the problem (4) reduces to 

minimize J(G,a) (7 a) 

G,a 



subject to « 2 Tr 



G (HH H + 7 2 I) G H j < P r . (7b) 



In the above, J{G, a) is a shorthand of J{G, a, B, C) for fixed B and C. Similar notation will 
be used throughout without further notice. We use the Lagrangian method to solve the problem 
in (7). The Lagrangian function is written as 

£(G, a, X) = Tr {w [l + BB H - 2^{CFGH{P + B) H } + CFGHH H G H F H C H 

+ 7 2 CFGG V^C" + a 2 a- 2 CC H ] } + A (Tr[a 2 G (HH H + 7 2 /) G H ] - P r ) , 

(8) 
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where A is the Lagrangian multiplier. By setting |^ = |^ = 0, we obtain 

^ = a> = * (9) 

d\ Tr[G(HH H + ^I)(G) H Y K) 

dC _ 4 _ a 2 Tr[WCC H ] 

^ a ~ ATr [G {HH H + ^I) (G) H ] ' ( } 

Hence, 

2 

\a 2 = ^-Tr[WCC H ]. (11) 

By setting ~ = and using (11), we obtain the optimal precoder G opt . Hence the following 
proposition: 

Proposition 1. For fixed (B, C), the optimal precoder to problem (7) is G opt = aG° pt with 
G° pt = (^Tr[WCC H }I + F H C H WCF^J F H C H W(P + B)H H (HH H + 7 2 {)2a) 
= p) (Tr \p opt (HH H + 7 2 l) (G° pt ) H j ) 1 . (12b) 
As mentioned earlier, a in (12b) is just a scaling factor to meet the relay power constraint. 

C. Optimal (B, C) for Fixed (G, a) 

For fixed (G, a), the optimization problem (4) reduces to an unconstrained problem as follows: 

minimize J(B,C). (13) 

B,C 

Proposition 2. For fixed (G,a), the optimal solution to (13) is given by 

B opt =D 1 (D 2 - D»D l )- 1 [PH H G H F H ] diag , (14a) 

C° pt =(D 2 - D?D i y 1 [PH H G H F H } diag , (14b) 

where 

D x = [FGH] diag e C KxK , (15a) 

D 2 = [FG (HH H + 1 2 I) G H F H + a 2 a' 2 I] diag e C KxK . (15b) 
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Proof: The sum MSE in (6) can be rewritten as 

J(B, C) = Tr \w + WBB H - 21k{WCS + WCD X B H } + WCD 2 C H ] (16a) 
= Tr[W] + ||W5(B-DiC)|||. - 2^{Tr[WCS]}+Tr[WC H {D 2 -D?D 1 )C](16b) 
= Tr[W\ + \\W*(B - D x C!)f F + \\W^{D Z C - D?S)f F - Tr[S H DfS], (16c) 

where (16a) follows by noting (15) and letting 

S = [FGHP T ] diag , (17) 



and (16b) is obtained by factorizing w.r.t. B, and (16c) follows by letting D 3 = (D 2 — -Df D 1 )z 
and factorizing w.r.t. C. In the above, D 3 = (D 2 — D± Di)^ is well defined by noting that 

[D 2 ] u >[FGHH H G H F% > [FGH]? [FGH] U = [D± D^u, i = l,...,K, (18) 

where step (a) follows by the definition in (15b), and step (b) follows (15a). From (16c), we 
see that the sum MSE is minimized when 

C = DfS and B = D 1 C, (19) 

which concludes the proof. ■ 
From (14), we notice that 

= D^ 1 = [C° pt FGH} diag . (20) 

That is, B opt is exactly the self-interference weights in the filtered signal Cr. This implies that, 
with B opt given in (14a), the self-interference is perfectly canceled at the user ends. 



D. Overall Iterative Algorithm 

The overall iterative algorithm is outlined by the following pseudo-code. 

Algorithm 1. 
i: Initial: B = B , C = C ; 

2: while the MSE can be reduced by more than e do 
3: Compute G and a using (12); 
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4: Compute B and C using (14); 
5: end while 

The convergence of Algorithm 1 is guaranteed as follows. It can be seen that the weighted 
sum MSE decreases monotonically in the iterative process. Together with the fact that the 
weighted sum-MSE is non-negative, we conclude that the sum MSE converges in iteration. 
From Proposition 2.7.1 in [23], it can be further shown that this convergence point is a local 
optimum of the original problem (4). The local optimum at which Algorithm 1 stops in general 
depends on the initial condition (B ,Co). We shall discuss the choice of the initial condition 
in Section V. 

IV. Sum- Rate Maximization 

In this section, we study the sum-rate maximization, which is another widely used metric in 
optimizing the performance of wireless networks. We show that the sum-rate maximization 
problem can be converted to an equivalent weighted sum-MSE minimization problem, and 
therefore, the results in Section III can be easily adopted to this case. 

A. Problem Formulation 

For reasons to be clarified later, we now use a diagonal matrix A, instead of B, to represent 
the weights of self-interference to be canceled. Note that letting A = yields the conventional 
non-PNC problem. Then, after self-interference cancelation, the signal becomes r — Ax. To 
simplify the index mapping of the received signal vector, let 

z = P T (r - Ax). (21) 

The ith element of z is given by 

Zi = pjFGh lXl + J2 pjFGhex e + pjFGu + pjv - pjAx, (22) 

where hi is the ith column of H. In this way, z% is the received signal of user for the 
recovery of the message from user %. Thus, the achievable rate of user % is given by 

1 / \p T FGh-\ 2 \ 

Ri = - log ( 1 + T (FGH _ A )\2_ \ v T FGh \2 , ~2|pT FG |2 + a 2 ) > ^ 
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where the factor 1/2 is due to the two-phase transmission. Our purpose is to maximize the sum 
rate under the power constraint of the relay. This optimization problem is formulated as 

K 

maximize R t (24a) 

G ?=i 

subject to Tr [G (HH H + 7 2 l) G H ] < P r . (24b) 

The problem (24) is non-convex w.r.t. (G, A) and thus is difficult to solve directly. We next 
describe an approach to convert this problem into an equivalent sum-MSE minimization problem, 
based on which an iterative algorithm similar to Algorithm 1 can be developed. 

B. Conversion to Weighted Sum-MSE Minimization 
Recall that x { ~ CAf(0, 1), i = 1, • • • , K, i.e., 

p(xi) = -e" 1 ^ 2 , i = l,-",K. (25) 

71 

From the equivalent transmission (22), it is readily to obtain the conditional distribution as 

P( z %\ x i) = —^i e S ' 1 > * = 1,---,^, (26) 

where 

E[ = pjFG(HH H - hX)G H F H p t + pjAA H p t + ^pjFGG H F H p t + a 2 pfp t . (27) 

With Bayes' rule, we can obtain a posteriori distribution p(xi\zi), which is also Gaussian and 
is given by 

p(xi\zi) = — — e ^ , i = l,---,K. (28) 

where Wi is a scaling coefficient to be determined, WjZj represents the conditional mean, and Ei 
represents the conditional variance. According to the definition of mutual information [24], the 
rate in (23) can be written as 



Ri = -E 
2 



, p(Xi\Zi 

log 



p(Xi) 



, i = l,---,K, (29) 
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where the expectation is take over the joint distribution of x and z, i.e., p(xi)p(zi\xi). With 
(28) and (25), we can express the sum-rate (24a) as a function of G, {ooi} and {Si}. Similar 
conversions have been previously used in [25], [26] for optimizing beamforming vectors in 
broadcast channels. Based on this conversion, we next establish a relation between the sum-rate 
maximization problem (24) and the weighted sum-MSE minimization problem (4). 

Substituting (28) and (25) into (29), we express the sum rate (24a) as a function of G, fi 
and 27 as 



i=l i=l v 1 



■ + xA 



(30) 



Plugging (21) into (30), the sum rate is rewritten in matrix form as 



K 



E 



E 



27 5 [(/+ fiP T A)x - fiP T r] 



K 



+ ^log^-K 



i=l 



K 



(31a) 
(31b) 



i=i 



where 



£ 

h 
n 



diag{27i,-- - ,E K }, 

diag{27 w -i (1) , • • • , £ n -i {K) } = P£P T , 
diag{o;i, • • • ,uj k }, 

diag{w w -i(i), • • • , uj n -i {K) } = POP T . 



(32a) 
(32b) 
(32c) 
(32d) 



The expectation taken over p(xi)p(zi\Xi) is equivalent to that take over p{xi)p{u n -i^)p{v^-i^) 
since the signal and each noise are independent. In (31b), E||27 _ 2 ((_p + f2A)x — fir) || 2 is 
the same as the weighted sum-MSE in (4a) by letting 2 



W = £~\ B = fiA, and C = fi. 



(33) 



Thus, we can develop an iterative algorithm similar to Algorithm 1 to solve the sum-rate 
maximization problem, as detailed below. 



2 To avoid confusion in establishing this relation, we use A, instead of B, to denote the weights of self-interference in the 
case of sum-rate maximization. 
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C. Iterative Algorithm 
Similarly to (5), we define 



G = a~ l G, A = a~ l A, and ft = aft, 



-l 



(34) 



where a still represents the scaling factor to meet the relay power constraint. With (34) and the 
received signal vector at the users r in (3), the sum rate in (3 lb) is expanded as 



1 



TZ(G,a,A,ft,S) =--Tr IT 1 ( / + ftAA H ft H - 2$l{ftFGH(P + ftA) H } 

2 



+ftFG (HH H + 7 2 I) G H F H ft H + a 2 a - 2 DD H 



Then the problem (24) can be equivalently expressed as 



maximize Tl{G, a, A, ft, S) 

G,a,A,f2,E 



K 



subject to a Tr 



G (HH H + 7 2 I) G 



H 



< Pv 



(35) 

(36a) 
(36b) 



Noting the similarity to problem (4), we develop an iterative algorithm to solve problem (36) as 
follows. 

1) Optimizing (G : a) for fixed (A,D,S): For fixed (A, D, £), the problem in (36) is 
the same as the one in (7) by letting W = £~ l , B = ft A and C = ft except for some 
additive constants. Thus, from Proposition 1, the optimal precoder can be immediately written 

as G opt = aG° pt with 

G° pt = ^yTr[Z- 1 ftft H ]I + F H fi H E- l ftF*j F H ft H IJ- 1 (P+ftA)H H (HH H + 1 2 iy 1 (37a) 
G° pt {HH H + 7 2 I) (G° pt ) H 



a o P t = pa / Tr 



(37b) 

2) Optimizing (A, ft, S) for fixed (G,a): For fixed (G,a), we aim to find the optimal 3- 
tuple (A, ft, S) that maximizes 1Z(A, ft, £). We first determine the optimal A. From (33) and 
(34), we see that (35) is equivalent to (6) by replacing W with -E 1-1 , B with ft A and C with ft. 
Together with the fact that the optimal B for (13) is given in (20), i.e., B opt = [C° pt FGH] diag , 



July 31, 2012 



DRAFT 



14 



the optimal A is given by 

A opt = [FGH] diag . (38) 

With (38), we obtain zl op< = aA° pt = [FGH] diag which consists of the self-interference weights 
in the received signal r. This means that the self-interference is perfectly canceled at the receiver 
ends. (Since self-interference is known precisely, it is rather obvious it should be completely 
canceled.) 

We next determine the optimal fl and 27. Recall that fl and 27 specify the means and variances 
of the Gaussian distributions p(xi\zi) in (28). Given (G,a, A), Zi and %i are linearly related by 
noting z = P T (r — Ax). Thus, from [23], the a posteriori mean and variance are respectively 
given by 

E[x t \z t ] = C XiZ £;X^ (39a) 

CxiXi\zi = CxiXi ~ Cxizfi ZiZ CziXii (39b) 

where the involved covariances are given by 

C XiXi = 1, (40a) 
C ZiXi = apjFGh h (40b) 
C ZiZl = a 2 pjFG(HH H + 1 "l)G H F H p t -a 2 pjAA H Pl + a 2 . (40c) 

Therefore, for fixed (G, a, A), the optimal fl and 27 can be written as 

uf = aC XiZi C;X and Ef = C XiXi]zi , i = 1, • • • , K (41) 

3) The iterative algorithm: The sum-rate optimization problem (24) can be solved by itera- 
tively solving the above two subproblems. The procedure is outlined in the following algorithm. 

Algorithm 2. 

i: Init: A = A , fl = f2 , 27 = 27 ; 

2: while the sum rate can be improved by more than 5 do 

3: Compute G and a using (37); 

4: Compute A, fl and 27 using (38) and (41); 
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5: end while 

Algorithm 2 converges, as the sum rate is bounded and monotonically increases in the iterative 
process. The convergence point depends on the initial point (f2 , S , A ). We will discuss the 
initialization in Section V. 

V. Asymptotic Analysis 

Algorithm 1 and 2 only guarantee local optima of the weighted sum-MSE minimization and 
sum-rate maximization problems. In this section, we carry out asymptotic analysis and show 
that, with proper initialization, the proposed iterative algorithms are asymptotically optimal in 
the low and high SNR regimes. For ease of discussion, we assign equal weights to the weighted 
sum-MSE minimization problem, i.e., W = I. In this way, as will be seen, the solutions to the 
sum-MSE minimization and sum-rate maximization allows unified expressions. Similar unified 
expressions exist for the solutions to the weighted sum-MSE minimization and weighted sum-rate 
maximization problems. The derivations are similar. We leave the details to interested readers. 

A. Low-SNR Analysis 

We start with the low-SNR case. For convenience, we focus on the limit where the noise 
levels a 2 and 7 2 tend to infinity at a same rate, i.e., 

a 2 ,7 2 ^+oo with 7 2 /o- 2 ->■ r, (42) 

where r e M + is an arbitrary positive constant. The main result is summarized as follows, the 
proof of which is given in Appendix A. 
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Theorem 1. In the limit of o" 2 ,7 2 — > +00 with rfjo 2 — > t, the asymptotically optimal 
precoders for sum-MSE minimization in (4) and sum-rate maximization in (24), with and 
without PNC, are identical and can be expressed as 

G° = aG°, (43) 

where G° is such that vec(G°) is an eigenvector corresponding to the maximum eigenvalue 
of 

K 

* = Y^Mi) T ®( FH p*tn (44) 

and scalar a is such that the precoder G° satisfies the power constraint with equality at the 
relay. 



At low SNR, the optimal precoder is identical with and without PNC in the limit. This is not 
surprising because when the noise dominates the received signal, the benefit of self interference 
cancelation is marginal. 

Proposition 3. As cr 2 ,^ 2 — > +00 with 7 2 /<t 2 — > r, Algorithms 1 and 2 converge to the same 
global optimum G° given in (43). 

The proof of Proposition 3 is given in Appendix B. According to Proposition 3, the point 
of convergence of Algorithms 1 and 2 is not sensitive to the initial condition in the low SNR 
regime. 

B. High-SNR Analysis 

In the high SNR regime, we are interested in the limit of 

a 2 ,7 2 ^0 and 7 2 /cr 2 ->■ r. (45) 

The asymptotically optimal precoders are described as follows, where the proof is given in 
Appendix C. 
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Theorem 2. Suppose N > K. In the limit of a 2 ,'-/ 2 — > with 7 2 '/a 2 — > t, the asymptotically 
optimal precoders for sum-MSE minimization and sum-rate maximization, with and without 
PNC, are identical and can be expressed as 

G°° = F ] C-\P + B)H\ (46) 

where C G C KxK is diagonal, and B e C KxK is an all-zero matrix in the non-PNC case 
and is a diagonal matrix in the PNC case. 



It has been shown in [27] that the precoder in (46) forces all the interference to zero, and 
hence is referred to as the zero-forcing precoder. Theorem 2 reveals that zero-forcing precoding is 
asymptotically optimal when the relay has no fewer antennas than the number of users (N > K). 
Otherwise, the relay does not have enough degree of freedom to force interference seen by all 
users to be zero. In this case, we may schedule fewer users to allow zero forcing. Then, Theorem 
2 can be applied to yield the optimal precoder. 

Proposition 4. Suppose N > K. As a 2 ,^ 2 — >■ with 7 2 / " 2 — > t, Algorithms 1 and 2 converge 
to G = F^C l (P + B )H ] for any initial values B = B and C = C . 

Proposition 4 suggests that, in the high SNR regime, the convergence points of Algorithms 1 
and 2 highly depend on the initial conditions. Therefore, it is necessary to carefully choose B 
and C in the high SNR regime, as detailed below. 

1 ) Sum-MSE Minimization: Plugging in the optimal precoder (46) and ignoring the high-order 
infinitesimals, we can rewrite the sum-MSE minimization problem in (4) (with W = I) as 

minimize Tr [ 7 2 (P + B) (H H H)- l {P + B) H + a 2 CC H ] (47a) 

subject to Tr[(FF H )- 1 C- 1 (P + B)(P + B) H C- H ] < P r (47b) 
B and C are diagonal. (47c) 

a) Non-PNC Scheme: In this case, we have B = 0. Then the optimal precoder is given in 
the following proposition with the proof in Appendix E. 
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Proposition 5. For the non-PNC case, i.e., B = 0, the optimal solution to (47) is given by C°° 
with the ith diagonal element being 

^Ev 7 /^) > i = l,---,K, (48) 

where is element of (FF H )~ l , and hij is element of (H H JET) -1 . 

b) PNC Scheme: In this case, B is allowed to be non-zero. Then, the problem in (47) is in 
general non-convex and is difficult to solve. In the following, we propose a suboptimal solution 
to iteratively optimize B and C. Define the index mapping j — % — 1, • • • , K. 

Proposition 6. (i) Given {cf\, the problem (47) is a convex problem of {bj} with the optimal 
solution 

6. = - _ 2 3 , i = l,-",K, (49) 

where A is a scalar to meet the relay power constraint, (ii) Given {b 3 }, the optimal phases of 
{cj} to the problem (47) are given by 

Zcj = Zc, - Af 3l - Z.b, - 7r, j = 1, • • • , K. (50) 

(Hi) Given {bf\ and the phases of {c^} in (50), the problem (47) is convex in {|cj|~ 2 }. 

Proposition 6 is proved in Appendix F. With Proposition 6, we can readily develop an algorithm 
to iteratively optimize {&,} and {c^}. We omit the details for simplicity. 

The phase-aligned algorithm in [17] is the special case of pairwise traffic pattern assuming 
the uplink and downlink channel are reciprocal. The phase setting (50) generalizes the previous 
result by relaxing the assumptions of pairwise pattern and channel reciprocity. 

2) Sum-Rate Maximization: Next, we investigate the high-SNR asymptotically optimal ma- 
trices B and C for sum-rate maximization. Plugging into the optimal form (46) and ignoring 
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the high-order infinitesimals, we rewrite (24) as 



maximize 



{bj},{cj} 



2 U 7 2 M~ 2 [KM 2 + 2»{ W + K] + 
Tr[(FF H )- 1 C-\P + B)(P + B) H C' H ] < P r . 




(51a) 



subject to 



(51b) 



a) Non-PNC Scheme: In this case, we have B = 0. The optimal precoder is given by the 
following proposition and the proof is given in Appendix G. 

Proposition 7. For B = 0, the optimal solution to (51) is written as 



where X is a scaling factor to guarantee that the relay transmits with its maximum power. 

b) PNC Scheme: In this case, B is in general non-zero. Similarly to the case of sum-MSE 
minimization, the joint optimization of B and C is difficult to solve. We next find a suboptimal 
solution by iteratively optimizing {b 3 } and {c 3 }. Similarly to Proposition 6, we have the following 
results, with the proof of which is given in Appendix H. 

Proposition 8. (i) Given {c 3 }, the problem in (51) has a closed-form optimal solution {bj}. (ii) 
Given {b 3 }, the optimal phases of {c 3 } to the problem (51) are given by (50). (Hi) Given {b 3 } 
and the phases of {c 3 } in (50), the problem (47) is convex in {|c.,| 2 }. 

To summarize, our asymptotic analysis reveals that the proposed iterative algorithms in Section 
III converge to the asymptotically optimal solution in (43) at low SNR, and that this convergence 
is insensitive to the initial conditions. At high SNR, Algorithms 1 and 2 converge to the asymp- 
totically optimal zero-forcing form in (46), but could perform poorly depending on the initial 
conditions of B and C. Therefore, in implementation, we set the initial values of Algorithms 1 
and 2 to the high-SNR optimal/suboptimal solutions of B and C (cf. Propositions 5 ~ 8). The 
performance of these algorithms are presented in the next section. 
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Fig. 2. The sum MSE comparison of different relaying schemes in the cases of four users. 



VI. Numerical Results 

We evaluate the sum MSE and the sum rate of the proposed MIMO switching schemes. In 
simulations, we assume unit transmit power for each user and the relay, i.e., P r = 1, and the 
noise levels at the relay and at the users are the same, i.e., a 2 = 7 2 . The channel SNR is then 
defined as SNR = l/a 2 = I/7 2 . Numerical results not presented here indicates that the system 
performance is not sensitive to the switching pattern. Thus, we only present the numerical results 
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The key findings are summarized as three observations below. 

Observation 1: The proposed iterative sum-MSE minimization (It-MSE-Min) algorithm performs 
significantly better than the MMSE schemes in [12], [15]. 

The MSE performance of the related schemes is illustrated in Fig. 2. The sum MSE of the 
MMSE scheme [12], [15] saturates at high SNR (say, SNR > 10 dB), since this scheme optimizes 
the precoder at the relay only. In contrast, the proposed It-MSE-Min algorithm jointly optimizes 
both the precoder at the relay and the receive filters at the users, resulting in vanishing MSE at 
high SNR. Moreover, the sum MSE can be further reduced by exploiting the PNC technique. 
From Fig. 2, the PNC gain is as significant as 6 dB at the sum MSE of 1CT 2 . 

Observation 2: The iterative sum-rate maximization (It-Rate-Max) algorithm achieves significant 
throughput gains over the existing relaying schemes, such as ZF/MMSE relaying [12], [15] and 
the network-coded relaying [13], [15]. 

Fig. 3 illustrates the throughput performance of various approaches including the proposed 
iterative sum-rate maximization scheme with PNC (It-Max-Rate PNC), the zero-forcing scheme 
without PNC (ZF non-PNC) in [12], [15], the MMSE scheme without PNC (MMSE non-PNC) 
in [12], [15], the balanced PNC scheme proposed in [15], 3 and the zero-forcing scheme with 
PNC (ZF PNC) in [17]. From Fig. 3, the proposed It-Rate-Max PNC algorithm significantly 
outperforms the other schemes throughout the SNR range of interest. Specifically, the proposed 
algorithm outperforms the MMSE non-PNC scheme, especially in the high SNR regime, since 
the former utilizes the PNC technique and jointly optimizes the precoder and the receive filter. 
The proposed algorithm also outperforms the zero-forcing schemes in [12], [15], [17], since 
the latters suffer from noise amplification. Furthermore, we also see that the proposed iterative 

3 A PNC scheme was proposed in [13] as well, which used the same block-diagonalization technique as that in [15]. However, 
the scheme in [15] induced an extra step to balance the channel gain of each user, which outperformed the scheme in [13]. 
Thus, we show the result of [15] in Fig. 3 only. 
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SNR(dB) 

Fig. 3. Throughput comparison of different relaying schemes in the cases of four users. 



max-rate scheme has roughly 1.5 dB gain over the balanced PNC scheme in [15] throughout 
the whole SNR range of interest. 

Observation 3: The PNC schemes achieve considerably higher throughputs than their corre- 
sponding non-PNC schemes, especially at medium and high SNR. 

Fig. 4 illustrates the PNC gain for the proposed It-MSE-Min/It-Rate-Max approaches, as well 
as for the zero-forcing relaying schemes in [10], [17]. At low SNR, the proposed It-MSE-Min/It- 
Rate-Max algorithms with and without PNC, exhibit roughly the same throughput performance, 
which numerically verifies Theorem 1. At high SNR, the proposed schemes with PNC achieve 
about 6 dB gain over the best non-PNC schemes (i.e., the It-Rate-Max scheme without PNC) 
at the sum rate of 8 bits per symbol period, which is the duration of transmitting a modulated 
symbol in the uplink/downlink. It is also interesting to see that, at high SNR, the proposed 
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SNR(dB) 

Fig. 4. Throughput comparison of network-coded relaying schemes and their non-PNC relaying schemes in the cases of four 
users. 



It-MSE-Min/It-Rate-Max algorithms exhibit similar throughput performance, either in the PNC 
case or the non-PNC case. This agrees well with the fact in Theorems 1 and 2 that the optimal 
precoders for It-MSE-Min/It-Rate-Max coincide, except that the asymptotically optimal diagonal 
matrices B and C are slightly different (cf. Proposition 4~8). 

VII. Conclusion 

In this paper, we proposed a unified approach to iteratively solve the sum-MSE minimization 
and sum-rate maximization problems for the wireless MEMO switching networks with and 
without PNC. We proved that although the proposed algorithms are suboptimal in general, 
they can converge to asymptotically optimal/suboptimal solutions in both the low and high 
SNR regimes. Numerical results demonstrate that the proposed iterative algorithms significantly 
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outperform the existing ZF and MMSE relaying schemes for MIMO switching. 

This paper makes several assumptions to simplify the design and analysis of the MIMO 
switching schemes. For example, we assume that each user has a single antenna, and full channel 
state information is available to the relay. It will be of theoretical and practical interest to 
investigate the impact of relaxing there assumptions on the MIMO switching design. We will 
look into these issues in future work. 



Appendix A 
Proof of Theorem 1 

We first consider sum-MSE minimization. When a 2 — > +00 and 7 2 / " 2 — > r, the scaling factor 
a in (12b) can be approximated as 



a 



7 2 Tr[GG 

Similarly, the optimal B and C in (14) are approximated as 

rH G H F H ] 

a 2 a- 2 [PH H G n F H ] diag + 0(a 



B* = a 2 a- 2 [FGH] dmg [PH H G H F H } di!lg + 0(a- 4 ), 



C 



opt 



Then, the sum MSE (6) is rewritten as 



J(G,a,B,C) = Tr[I -2$t{a 2 a- 2 [PH H G F^U^FGHP 1 

+ 1 2 a^[PH H G H F H ] diag FGG H F H [PH H G H F H ' 



diag 



+a 2 a- 2 [PH H G H F H ] diag [PH H G F 1 



J diag 



+ 0(<7- 



K + Tr 



+a 2 a- 2 [PH H G H F H ] diag [FGHP J 



J diag 



+ 0(a 



= K-Tr 



a 2 a- 2 [PH H G H F H ] dmg [FGHP T \ 



diag 



+ 0(a 



(53) 



(54a) 
(54b) 



(55a) 



-23? \a 2 <j- 2 [PH H G H F H ] diag FGHP T } (55b) 



K - Tr [a 2 a- 2 [H H G H F H P] diag [P T FGH] diag \ + O^ 6 ) 

K 

K -a- 2 J2\pJ FGh i\ 2 + 0{a- Q ), (55c) 
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where (55a) follows by plugging in (54a) and (54b); (55b) follows from (53); (55c) follows from 
(5). 

Now we consider the sum-rate in (24a). In the low SNR regime, this sum rate becomes first 
order approximated as 

f> = ^f>^G^| 2 + 0(a- 4 ). (56) 

From (55c) and (56), we see that minimizing the sum MSE is equivalent to maximizing the sum 
rate in the low-SNR regime, with the solution given by solving 

K 

maximize ^ \pjFGhj\ 2 (57a) 
subject to Tr[GG H ] < ^. (57b) 

Note that 

\pjFGh £ \ 2 = Tr[pjFGh e h?G H F H p £ ] 
= Tr[G H F H p e pj F Ghfhf] 
= g H vec(F H p e pjFGh e hf) 

= g H ((h i h?)®(F H p e pjF))g, (58) 

where g = vec(G), and the last step follows from vec(ABC) = (C <E> A)vec(B) in Lemma 
4.3.1 [21]. With the definition in (44), the problem in (57) can be equivalently written as 

maximize g H ^g (59a) 

9 

p 

subject to g H g < (59b) 

7 2 

The optimal g to the above problem is an eigenvector corresponding to the maximum eigenvalue 
of \P, which concludes the proof. 
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Appendix B 
Proof of Proposition 3 

We first consider the convergence point of Algorithm 1. Clearly, any convergence point 
of Algorithm 1 satisfies both (12) and (14). In the low SNR regime, (14) can be first-order 
approximated as (54). Plugging in (54), (12a) can be first-order approximated as 



G 



1 



a -Tr[CC H ] 
Tx[GG H ] 



F H C H PH H 



F [FGHP \diagPH 



H 



\\diag{PH H G H F H }\\ 2 F 
= iF H [FGHP T ] di&g PH H 

= £F H P[P T FGH] diag H H . 
In the above, (60a) follows by noting that as a 2 — y +oo and 7 2 / " 2 — > t, 



(60a) 

(60b) 

(60c) 
(60d) 



^ Tr[CC H ]I + F H C H WCF 

P + B 

mi" + 7 2 / 



1 2 I. 



(61a) 
(61b) 
(61c) 



(60b) follows by plugging in (53) and (54b). (60c) follows by defining 

Tr\GG H ] 



(62) 



u diag{PH H G H F H }\\ 2 ' 

(60d) follows from P T [A] di£Lg P = [P T AP] diag . Let f e to be column £ of F H P. Recall that 
hg is column £ of H, i = 1, • • • , K. We obtain 

H 



F H P[P T FGH} diag H H =[ fl ... f K ] diag{/f Gh u • • • , f* Gh K } 

K 



h 



K 



if) 



J2(F H PeP jF)G(h e hf) 



&9 



(63a) 
(63b) 

(63c) 
(63d) 
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where the last step follows from vec(ABC) = (C T <g> A)vec(B) and the definition in (44). 
With (63d), the optimal precoder in (60d) is rewritten as 

9 = &9, (64) 

implying that the convergence point of Algorithm 1 must be one of the eigenvectors of \P. 
Furthermore, the iterative process of Algorithm 1 is equivalent to recursively calculate \Pg and 
then update g with \Pg normalized by £. This process is called power iteration in linear algebra 
[28], with the fixed point given by the eigenvector corresponding to the largest eigenvalue of \P '. 
Thus, Algorithm 1 converges to the optimal solution in (43). 

Next, we consider the convergence point of Algorithm 2. We aim to show that this convergence 
point is also given by (64). To this end, we note that the equivalence between the sum-rate 
maximization problem and the weighted sum-MSE minimization problem is established based 
on (33) as pointed out in Section IV.B. We also note that the convergence point of the sum-MSE 
minimization problem (i.e., XV = I) is given by (64). Therefore, it suffices to show that, in the 
low SNR regime, the sum-rate maximization is equivalent to the sum-MSE minimization. From 
(33), we thus need to show that W = S^ 1 I in the low SNR regime, as detailed below. 

When a 2 — > +oo and 7 2 / " 2 — > r, the scaling factor a can be approximated as in (53), and 
hence the covariance (40c) can be approximated as 

C ZiZi » a 2 . (65) 

Together with (40b) and (53), we have 

Ei w 1 - a 2 (pjFGhi)*a- 2 a 2 pjFGhi « 1, i = 1, • • • , K. (66) 

or equivalently, 

S w I, (67) 

which completes the proof. 
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Appendix C 
Proof of Theorem 2 

Recall that F is a K-by-N matrix. Then 

(f h C H WCF^j = F^C\C H )\F H )\ (68) 

When a 2 — > and 7 2 /(x 2 — > r, together with W = I, the precoder in (12a) reduces to 

G opt = aF ] C~ l (P + B)H ] . (69) 

From Proposition 1, the precoder in (69) is the optimal solution to (4) for any given B and C. 
Since B and C are arbitrary, we conclude that the asymptotically optimal precoder for sum-MSE 
minimization is given by (69). 

Similarly, the asymptotically optimal precoder for sum-rate maximization can be obtained by 
letting o 2 — y and 7 2 /(x 2 — > r in (37), yielding 

G opt = aF iQ-\p + QA)H ] . (70) 
Thus, we see that the theorem is readily proved by letting C = fl and B = flA. 

Appendix D 
Proof of Proposition 4 

We first see the high-SNR approximation of the solution (12a), (14a) and (14b) for sum-MSE 
minimization problem. When a 2 — > and 7 2 /<r 2 — > r, from (12a), the optimal solution of G 
can be rewritten as 

G opt = F ] C-\P + B)H ] . (71) 

Plugging (71) into (14a) and (14b), we have 

B opt = B and C opt = C. (72) 

This result indicates that the solution of Algorithm 1 is trapped at the initial value at high SNR. 
Next, we see the high-SNR approximation of the solution (37) and (41). Plugging (46) into 
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(41), we have 

= ooi and E° pt ->■ 0, i = 1, • • • , X. (73) 

Note that the term TrflT 1 J?/^] in (37) is still finite due to a 2 ->■ 0. This result indicates that 
the solution of Algorithm 2 is also trapped at the initial condition. Thus, the proof is completed. 

Appendix E 
Proof of Proposition 5 

Plugging in (46), the sum MSE in (6) can be rewritten as 

J = 1 2 P{H H H)- 1 P T + a 2 CC H , (74) 
and the relay power consumption in (7b) is rewritten as 

Tr[G (HH H + 7 2 /) G H \ « Tr[GHH H G H ] = Tr[(FF H )- 1 CC H ]. (75) 
Then, the problem (4) can be minimized equivalently by solving 

K 

]TM 2 (76a) 



minimize 

ci,— ,ck 



i=\ 

K 

subject to V < P r , (76b) 

t=i 1 *' 

where q is the £th diagonal element of C. It can be readily verified that problem (76) is convex. 
Solving the KKT conditions [29], we obtain the solution in (48). 

Appendix F 
Proof of Proposition 6 

We start with the proof of part (i). The problem (47) can be equivalently expressed as 

K 

minimize V] (7 2/ %N 2 + 2 7 2 ^{A* A} + o- 2 |q| 2 + ^ 2 h u ) (77a) 

6l,-" f>K ,C1,— ,CK '~ / 
1=1 

K 

subject to (fM~ 2 + fM~*\ b *\ 2 + /j^rVk + U<Y l c; l K < Pr) ■ (77b) 

1=1 
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For fixed {a,}, both the objective function (77a) and the constraint (77b) are quadratic functions 
of {bj}. Hence it is straightforward to see that the problem (77) (or equivalently (47)) is convex 
in {bj}. By the Lagrangian method, The optimal {b 3 } can be obtained as 

_ 7V ( ,),, + A/^p-y 

l ~ +\f id- 2 — ' l - 1 >"-> iir > ™ 

I It'll \ 

where A is a scalar to meet the relay power constraint. 

Next, we prove part (ii) and (iii). Since the sum MSE (77a) is independent of the phases of 
{cj}, the optimal phases must minimize the relay power consumption in (77b). Note that (77b) 
can be rewritten as 

K 

(IS"' + fjibiC^l 2 + (fn + f u \b % \ 2 - l/jAHld- 2 - \Cj\~ 2 ) < Pr- (79) 

i=l 

Therefore, for any given cj 1 and /^Ac" 1 have opposite phases, or equivalently, the optimal 
phases of {c,} satisfy 

Zc 3 =Z Cl - Zf Jt - Z6, - 7T, j = 1, • • • , K. (80) 
With (80), the relay power consumption in (79) reduces to 

K 

(fM~ 2 + /«lcl" 2 IM 2 - 2|^||c,|- 1 |c,|- 1 |6,|) . (81) 

i=i 

Let 77, = Id -2 , % = 1, • • • ,K. Clearly, the objective (77a) is convex in {q l }. In addition, 
I c jI _1 | c i| _1 = \fWh-> me geometric mean of r\ 3 and r\ % , is concave in r\ 3 and rj t [29]. As non- 
negative weighted sums preserve concavity, Ylf=i 2 1 fji I y/Wh is concave in {^}, and hence the 
power in (81) is convex in {^}. Therefore, the problem (77) is a convex problem of {r] t }, which 
concludes the proof. 
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Appendix G 
Proof of Proposition 7 

When a 2 — > and 7 2 /(7 2 — > t, the sum rate (24a) is approximated as 

K 





pjFGh t 


2 


7 2| 


p?FG 


2 + a 2 



The constraint of the relay power consumption (24b) is approximated as 



K 

Tr[GHH H G H ] = ]T f J3 \ Cj \- 2 < P r , (83) 
We obtain the solution in (52) by solveing the KKT conditions. 

Appendix H 
Proof of Proposition 8 

The proof of Proposition 8 is similar to that of Proposition 6. The sketch of the proof is given 
as follows. We start with the proof of part (i). For fixed {cj}, we use the Lagrangian method to 
solve the problem (51). The Lagrangian function is written as 



1 K 



-2 



+a [J2fM~ 2 + fM~ 2 N 2 + fAc^'W + UKr^b: - p r )(84) 



,i=i 



Setting the derivatives w.r.t. {b t } to zero, we obtain 

(l 2 h n \b t \ 2 + 2<y 2 $i{hl- 1{l)l b l } + 7 2 /i ff -i W)W -i W + ct 2 |c j | 2 )(7 2 /i m 6 j + 7 2 /i 7r -i( l ), J ) (85) 

+ A (/ M | Cl |- 2 6, + ^(c:)- 1 ^ 1 ) = 0. (86) 

The above is a cubic equation w.r.t. b p which allows closed form solution [30]. 

We next prove part (ii) and (iii). Since the sum rate in (51a) is independent of the phases of 
{cj} and problem (51) has the same constraint as problem (77), we conclude that the optimal 
phases of {c 3 } also satisfy (80). Moreover, let r] t = \c t \~ 2 , % — 1, • • • , K. It can be shown that 
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the constraint (51b) is convex in {r] l }, and that the objective (51a) is concave in {r] l }. Thus, the 
problem (51) is a convex problem in {rj t }, which complete the proof. 
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